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Naturalness considerations, together with the non-observation of superpartners of the Standard 
Model particles at the Large Hadron Collider (LHC) so far, favor supersymmetric (SUSY) models 
in which third generation squarks are significantly lighter than those of the first two generations. 
In such models, gluino pair-production is typically the dominant SUSY production process at the 
LHC, and it often leads to final states with multiple top quarks. Some of these top quarks may 
be relativistic in the lab frame, in which case their hadronic decays may produce "top jets". We 
propose that the recently developed techniques for tagging top jets can be used to boost sensitivity 
of the LHC searches for this scenario. For example, within the simplified model used for this study, 
we estimate that a search with 2 top-tagged jets can probe gluino masses of up to about 1 TeV at 
the 7 TeV LHC with 30 fb _1 integrated luminosity. 



Introduction — Recently, experiments at the Large 
Hadron Collider (LHC) have begun searching for new 
physics beyond the Standard Model (SM). Among the 
many theoretical ideas about the possible nature of this 
new physics, supersymmetry (SUSY) is the most popular 
one: it provides an appealing solution to the gauge hi- 
erarchy problem of the SM, contains an attractive dark 
matter candidate, and fits naturally in the framework 
of grand unification and string theory. SUSY models 
predict a number of new particles, "superpartners" of 
the known SM particles, which may be produced at the 
LHC. In the simplest SUSY models, all superpartners 
are odd under a discrete symmetry, R-parity, while all 
SM particles are R-even. This implies that the lightest 
SUSY particle (LSP) is stable, and that any other super- 
partner will decay to the LSP and one or more SM par- 
ticles. Cosmological considerations strongly prefer the 
LSP to be electrically neutral and uncolored, so that at 
the LHC the LSP passes through the detector without in- 
teractions, leading to an apparent transverse momentum 
imbalance, or "missing transverse energy" (MET). The 
presence of MET provides a distinct signature which can 
be used to distinguish SUSY events from the (far more 
numerous) SM backgrounds. 

At the time of writing, the LHC experiments have pre- 
sented searches for events with anomalous MET using 
a data set of approximately 1 fb _1 collected in 2010-11 
at the center-of-mass energy of y/s = 7 TeV. No evi- 
dence for anomalous MET has been found, and limits 
on superpartner masses have been set. Barring acci- 
dental features such as spectrum degeneracies, gluinos 
g and squarks of the first two generations have been 
ruled out for masses up to about 1 TeV [Ij. In models 
where all squarks have a common mass at some energy 
scale, this bound implies that a significant amount of fine- 
tuning would be necessary to accommodate the observed 
electroweak symmetry breaking scale [2]. On the other 
hand, fine-tuning can be avoided if the third-generation 
squarks, stops t and sbottoms b, are significantly lighter 
than qi 2 |-'> . The LHC bounds on third-generation 



squarks are quite weak: stops above 200-300 GeV are 
currently allowed. The only other superpartner whose 
mass is significantly constrained by naturalness is the 
gluino 0]; at present, gluinos above 600 GeV are allowed 
if decaying only via the 3rd generation. With this moti- 
vation, we will focus on a scenario where gluinos, third- 
generation squarks, and a neutralino LSP are the only 
particles relevant for the LHC phenomenology, with other 
squarks being too heavy to be produced. An explicit ex- 
ample of a complete theory realizing this spectrum is the 
"accidental SUSY" models of Refs. [5J. 

The lack of discovery so far also implies that traditional 
SUSY searches using the MET signature will become 
more difficult, since the large-MET tails of SM back- 
grounds will need to be calculated (or extrapolated) with 
increasingly high precision to obtain sensitivity to lower 
SUSY cross sections. This motivates the question: Can 
any handles other than MET be used to identify SUSY 
events in the presence of large SM backgrounds? In this 
Letter, we explore an alternative signature. Gluino cas- 
cade decays to the LSP via intermediate stops produce 
two top quarks, so that gluino pair-production events 
may result in final states with four tops [3 [8]. If the 
gluino-stop and stop-LSP mass differences are sufficiently 
large, each of these tops will typically be relativistic in the 
lab frame, and its hadronic decay products will be merged 
into a single jet. Recently, much work has been done 
on distinguishing such top jets from the usual hadronic 
jets using the energy distribution inside the jet, and sev- 
eral well-tested algorithms for "tagging" top jets are now 
available The original motivation was to search for 
decays of the Kaluza-Klein gluon in models with extra 
dimensions |10j : other proposed applications include a 
search for the string- Regge excitation of the gluon [llj . 
and a search for direct stop production in SUSY [12]. 
Here, we point out that this technique can also be used to 
search for the SUSY gluino, and is particularly promising 
in scenarios with a light third generation, since g decays 
to tops have large branching fractions in this case. 

Analysis Setup — In the spirit of the "simplified 
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model" approach [TH] E], we assume that a gluino g, one 
stop t, and a single neutralino \° are the only superpart- 
ners relevant for the LHC phenomenology. This is the 
minimal set of particles required to produce our signa- 
ture. In the Minimal Supersymmetric Standard Model 
(MSSM), this setup can be realized if the second stop 
and the left-handed sbottom are heavier than the gluino. 
(Note that naturalness considerations in the MSSM pre- 
fer spectra with a few hundred-GeV splitting among the 
two stop mass eigenstates [15].) If this is not the case, the 
branching ratios of the decays producing our signature 
would be reduced (e.g. from 1 to 2/3 if all three squarks 
are degenerate), resulting in a somewhat decreased rate, 
but qualitatively the picture is unchanged. We assume 
that the neutralino is the stable LSP, and set its mass to 
60 GeV throughout the analysis. The LHC signal is dom- 
inated by gluino pair-production, followed by the cascade 
decay 

g->i + i, i^tx , (1) 

or its charge conjugate. We assume that m(g) — m(t) > 
m tl m(t) — m(x a ) > ni t , so that all four tops in the event 
are on-shell. (It may be possible to relax one of these 
conditions, as long as the other one is satisfied strongly 
so that at least two tops in the event are boosted; we will 
not study that possibility here.) We compute gluino pair- 
production cross sections at next-to-leading order (NLO) 
using PR0SPIN0 [IB]. To study cut efficiencies, we gen- 
erate event samples for gluino pair-production followed 
by the decays ([T]) using MadGraph/MadEvent v5 1 . 3 . 27 
(MG/ME) [17] for a large set of parameters (m(g) , m(t)) . 
We then simulate top decays, showering and hadroniza- 
tion with PYTHIA 8 18 . To identify jets, we use the anti- 
kx algorithm implemented in the Fast Jet code [HTI 120j . 
Top tagging of jets in our sample is simulated using the 
implementation of the Hopkins algorithm |21j available 
at [20]. In the top tagger, we use two sets of parameters, 
"tight" and "loose" tags; they are defined precisely as in 
Ref. 0. 

We require at least 4 jets with pt > 100 GeV in each 
event, and require that some of the jets be top-tagged. 
(The optimal number of top-tagged jets required depends 
on the LHC energy and luminosity, see below.) In the sig- 
nal, tagged jets are typically due to hadronic decays of 
boosted tops, which produce 3 collimated partons that 
cannot be resolved. The backgrounds include SM pro- 
cesses with boosted tops, as well as ordinary jets mis- 
takenly tagged as top-jets. (The mistag probability is 
typically of order 1% [9 .) We also require the pres- 
ence of substantial missing energy. The irreducible back- 
grounds may contain MET from invisible Z decays, lep- 
tonic W decays, or semileptonic top decays. We include 
the following irreducible backgrounds: nt + (4 — n)j with 
n = 1 ... 4; Z + nt + (4 - n)j, with n = 0, 2, 4; and 
W + nt + (4 — n)j, with n = 0, 2, 4. Here each t may 
be a top or an anti-top, j denotes a jet due to a non- 
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FIG. 1: Signal at the benchmark point, (m(g),m(t)) = 
(800, 400) GeV, and background rates as a function of MET, 
at 7 TeV LHC. Four jets with p T > 100 GeV and two top- 
tagged jets are required. 



top quark or a gluon, and Z — > vv or W — > tv is re- 
quired. We do not include reducible backgrounds, other 
than the light jets mistagged as tops. We simulated the 
backgrounds at parton level with MG/ME, and used these 
samples to compute pr and MET cut efficiencies. We 
use leading-order (LO) cross sections for all background 
processes. The two dominant backgrounds, 2t + 2j and 
Z + 4j, have been recently computed at NLO. In both 
cases, the NLO correction to the cross section is negative: 
K-factors of 0.73 for 2t + 2j 22\ and 0.95 for Z + 4? [23J 
have been reported, so that using LO cross sections for 
these processes is conservative. No other backgrounds 
are currently known beyond the LO. 

Unfortunately, due to large QCD rates and small 
mistag probabilities, we were not able to generate Monte 
Carlo samples large enough to measure top-tag efficien- 
cies directly in the background channels. Instead, we es- 
timate these efficiencies by multiplying the py-dependent 
tag and mistag probabilities for individual top and non- 
top jets reported in Ref. [9J. This estimate assumes that 
the tag and mistag probabilities for each jet are indepen- 
dent of the presence of other objects in the final state 
(the probabilities in [3] were computed using ti and 2j 
samples). The probability to tag a true top jet as such 
is clearly reduced by the presence of other jets in the 
event: for example, the tag efficiency for our signal ap- 
proximated in this way is typically about a factor of two 
higher than that obtained by a full simulation. So, our 
estimate of backgrounds involving tops, such as 2t + 2j, 
is certainly conservative. It is less clear how the mis- 
tag probability would be affected; we leave this issue for 
future work. 

LHC Sensitivity at ^/s — 7 TeV — To keep the anal- 
ysis simple, we optimize the selection cuts for a single 
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Process 


Ctot 


Eff(p T ) 


Eff(tag) 


Ctag 


Eff(#r) 


<^all cuts 


signal 


61.5 


37 
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1.31 


81 


1.06 


Z + Aj 


2 x 10 5 


0.2 


0.1 


0.44 


66 


0.29 


It + 2j 


5 x 10 4 


3 


0.3 


5.7 


2 


0.10 


W + 4:j 


2 x 10 5 


0.2 


0.03 


0.12 


29 


0.04 


Z + 2t + 2j 


50 


4 


1 


0.02 


72 


0.02 



TABLE I: Signal and background cross sections (in fb) and 
cut efficiencies (in %) at the 7 TeV LHC. Acceptance cuts of 
Pt > 20 GeV, \n\ < 5 for all jets are included in the total cross 
sections. The cuts are labelled as follows: u Pt"'- requiring 4 
jets with pr > 100 GeV; "tag": requiring 2 jets to be tagged 
as tops with "loose" parameters; requiring fr > 100 

GeV. The signal is at the benchmark point, (m(g),m(t)) = 
(800, 400) GeV. Backgrounds not listed here are negligible. 



"benchmark" point in the model parameter space, and 
do not vary them as we scan the masses. At 7 TeV, we 
choose the benchmark point (m(g),m(t)) = (800,400) 
GeV. We studied all possible combinations of between 
and 4 loose and tight top tags, and conclude that requir- 
ing 2 loose tags is the best strategy at this point. Anal- 
yses requiring more than 2 tags, or 2 or more tight tags, 
suffer from low event rate, making a search in the 7 TeV 
LHC run with 20 — 30 fb -1 integrated luminosity imprac- 
tical. Requiring fewer tags leads to significantly higher 
background rates, decreasing sensitivity [23]. The two 
top tag requirements strongly suppress the backgrounds, 
as illustrated in Table |TJ but are not by themselves suf- 
ficient, so that an additional MET cut must be applied. 
The signal and principal backgrounds as a function of 
MET are shown in Fig. [I] We require > 100 GeV; 
with this cut, we expect 32 signal events, S/B = 2.4, 
and statistical significance of 6.8 at the benchmark point 
with 30 fb -1 integrated luminosity. The reach of the LHC 
with this data set is shown in Fig. [2] (The 95% exclusion 
contour is calculated using the expected CL S 25 . The 
discovery significance is determined using the expected 
log likelihood of consistency with the signal plus back- 
ground hypothesis |26|.) Gluino masses of up to about 1 
TeV can be probed at the 95% confidence level, as long as 
the gluino-stop mass difference exceeds 400 GeV. The 5- 
sigma discovery reach extends to a gluino mass of about 
900 GeV for stop masses below 350 GeV. We should also 
note that S/B > 1 throughout the probed region, so no 
extraordinarily precise predictions of the background are 
required. 

LHC Sensitivity at ^/s = 14 TeV — Anticipating 
higher reach of the search at 14 TeV, we optimize the 
selection cuts for a benchmark point with higher masses, 
(m(g),m(t)) = (1200,600) GeV. After again considering 
all possible combinations of loose and tight tag require- 
ments, we conclude that the optimal strategy in this 
case is to require three loose tags. We further require 
tfjr > f 75 GeV. At the benchmark point, we expect 8.5 
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FIG. 2: The 95% c.l. expected exclusion and 5-sigma discov- 
ery reach of the proposed search at the 7 TeV LHC run with 
30 fb -1 integrated luminosity. 
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FIG. 3: The 95% c.l. expected exclusion and 5-sigma discov- 
ery reach of the proposed search at the 14 TeV LHC run with 
10 fb -1 integrated luminosity. 

signal events to pass these cuts in a data set of 10 fb -1 , 
and with S/B = 27.5 the expected statistical significance 
of observation is 6.5. The reach of a search with these 
parameters is shown in Fig. [3j Discovery is possible up to 
1.3 — 1.4 TeV gluino masses with stops in the 300 — 700 
GeV mass range. In this case, S/B > 10 throughout 
the discovery region. 

Given how effective the top tagging technique is in sup- 
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pressing backgrounds, it is natural to wonder whether, 
given enough data, a search for gluinos could be con- 
ducted with no MET requirement at all. Unfortunately, 
this is not possible. While the backgrounds studied above 
are sufficiently suppressed, a new irreducible background, 
pure QCD events with 4 hard jets (pr > 100 GeV), must 
be included in the absence of a MET cut. The rate for 
this process is so large (5.3 nb at 14 TeV at tree-level) 
that, even including the small mistag probabilities for 
light jets, it overwhelms the signal. We estimate that the 
most sensitive search without a MET cut is again one 
with 3 loose top tags required. For a 300 fb _1 data set, 
this search is sensitive to the benchmark point at about 
4.5 sigma level (statistics-only), but with S/B ~ 0.1, sys- 
tematic errors are probably too large to claim sensitivity. 

Discussion — Our analysis indicates that using tagged 
top jets as an additional handle to suppress SM back- 
grounds in the search for gluino decaying to stops leads 
to interesting reach, even in the 7 TeV run. In fact, the 
reach may be even higher than we estimate, since we did 
not perform a thorough cut optimization for various re- 
gions of the model parameter space, instead simply freez- 
ing the cuts to values that were found to be near-optimal 
for a single benchmark point. 

While we made several simplifications in this ex- 
ploratory study, the promising results in our opinion jus- 
tify a more complete analysis. Most of the outstanding is- 
sues concern backgrounds. For irreducible backgrounds, 
the fixed-order (tree-level) simulations used here should 
be supplemented with showering and hadronization, al- 
though since the jets used in our analysis are required 
to have rather high pr, we do not expect qualitative 
changes. Also, MC samples with higher statistics should 
be used to fully simulate top-tagging efficiencies on the 
backgrounds. Reducible backgrounds, which were ig- 
nored here, should be studied. The most important one 
of these is the pure QCD channel, 4j at parton level, 
which has a very high rate even with a 2 or 3 mistagged- 
jet requirement. The pure-QCD events passing our cuts 
lie far on the tail of the MET distribution for this chan- 
nel, where the MET is entirely due to undetected or in- 
correctly measured jets. Correctly estimating this back- 
ground would thus be a task for a complete detector sim- 
ulation or a data-driven approach, which must be per- 
formed by the experimental collaborations. It is impor- 
tant to note, however, that large-MET QCD tails affect 
all SUSY searches at the LHC relying on MET, and in the 
purely hadronic searches this effect is typically subdomi- 
nant to the reducible backgrounds once appropriate cuts 
are applied to eliminate events with MET aligned with 
one of the jets Q] • Similar techniques can be applied in 
our case. 

Conclusions — If SUSY is realized in such a way that 
stops and sbottoms are the only squarks below the TeV 
scale, as favored by naturalness and recent negative re- 
sults from the LHC, top-rich final states are a natural 



place to search for it. Our results indicate that the tech- 
niques to separate top jets from light jets, developed re- 
cently with a completely different motivation, can be 
employed to boost sensitivity of such searches. They 
can complement other proposed strategies for this sce- 
nario [HI HZ] , especially in the heavy gluino region. We 
encourage the experimental collaborations to incorporate 
this tool in the upcoming searches. 
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